Amino acid coupling patterns in thermophilic proteins.

نویسندگان

  • Han-Kuen Liang
  • Chia-Mao Huang
  • Ming-Tat Ko
  • Jenn-Kang Hwang
چکیده

Structural analysis is useful in elucidating structural features responsible for enhanced thermal stability of proteins. However, due to the rapid increase of sequenced genomic data, there are far more protein sequences than the corresponding three-dimensional (3D) structures. The usual sequence-based amino acid composition analysis provides useful but simplified clues about the amino acid types related to thermal stability of proteins. In this work, we developed a statistical approach to identify the significant amino acid coupling sequence patterns in thermophilic proteins. The amino acid coupling sequence pattern is defined as any 2 types of amino acids separated by 1 or more amino acids. Using this approach, we construct the rho profiles for the coupling patterns. The rho value gives a measure of the relative occurrence of a coupling pattern in thermophiles compared with mesophiles. We found that thermophiles and mesophiles exhibit significant bias in their amino acid coupling patterns. We showed that such bias is mainly due to temperature adaptation instead of species or GC content variations. Though no single outstanding coupling pattern can adequately account for protein thermostability, we can use a group of amino acid coupling patterns having strong statistical significance (p values < 10(-7)) to distinguish between thermophilic and mesophilic proteins. We found a good correlation between the optimal growth temperatures of the genomes and the occurrences of the coupling patterns (the correlation coefficient is 0.89). Furthermore, we can separate the thermophilic proteins from their mesophilic orthologs using the amino acid coupling patterns. These results may be useful in the study of the enhanced stability of proteins from thermophiles-especially when structural information is scarce. Proteins 2005. (c) 2005 Wiley-Liss, Inc.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Patterns of temperature adaptation in proteins from Methanococcus and Bacillus.

It has long been known that amino acid substitutions in proteins of organisms living at moderate and high temperatures (mesophiles and thermophiles, respectively) are not all symmetrical; for example, more aligned sites have lysine in mesophiles and arginine in thermophiles than have the opposite pattern. This is generally taken to indicate that certain amino acids are favored over others by se...

متن کامل

Patterns of temperature adaptation in proteins from the bacteria Deinococcus radiodurans and Thermus thermophilus.

Asymmetrical patterns of amino acid substitution in proteins of organisms living at moderate and high temperatures (mesophiles and thermophiles, respectively) are generally taken to indicate selection favoring different amino acids at different temperatures due to their biochemical properties. If that were the case, comparisons of different pairs of mesophilic and thermophilic taxa would exhibi...

متن کامل

A Novel Statistical Method for Thermostable Proteins Discrimination

In this study, we used features that can be extracted from protein sequences to discriminate mesophilic, thermophilic and hyper-thermophilic proteins. Amino acid frequency, dipeptide amino acid frequency and physical-chemical features are used in this study. The effect of mentioned features on proposed discrimination algorithm was evaluated both separately and in combination. Statistical method...

متن کامل

Identification of thermophilic species by the amino acid compositions deduced from their genomes.

The global amino acid compositions as deduced from the complete genomic sequences of six thermophilic archaea, two thermophilic bacteria, 17 mesophilic bacteria and two eukaryotic species were analysed by hierarchical clustering and principal components analysis. Both methods showed an influence of several factors on amino acid composition. Although GC content has a dominant effect, thermophili...

متن کامل

Internal correspondence analysis of codon and amino-acid usage in thermophilic bacteria.

Starting from two datasets of codon usage in coding sequences from mesophilic and thermophilic bacteria, we used internal correspondence analysis to study the variability of codon usage within and between species, and within and between amino acids. The first dataset included 18,958,458 codons from 58,482 coding sequences from completely sequenced genomes of 25 species, along with 6,793,581 din...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proteins

دوره 59 1  شماره 

صفحات  -

تاریخ انتشار 2005